SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Lau, Lester F. 

(ii) TITLE OF INVENTION: Extracellular Matrix Signalling Molecules 
(iii) NUMBER OF SEQUENCES: 17 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Marshall, 0' Toole, Gerstein, Murray & Borun 

(B) STREET: 6300 Sears Tower, 233 South Wacker Drive 

(C) CITY: Chicago 

(D) STATE: Illinois 

(E) COUNTRY: United States of America 

(F) ZIP: 60606-6402 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

( C ) C LAS S I F I CAT I ON : 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Clough, David W. 

(B) REGISTRATION NUMBER: 36,107 

(C) REFERENCE/DOCKET NUMBER: 28758/33766 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 312/474-6300 

(B) TELEFAX: 312/474-0448 

(C) TELEX: 25-3856 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 180. .1316 



(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(D) OTHER INFORMATION: "Mouse cyr61 cDNA coding sequence" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CGAGAGCGCC CCAGAGAAGC GCCTGCAATC TCTGCGCCTC CTCCGCCAGC ACCTCGAGAG 60 

AAGGACACCC GCCGCCTCGG CCCTCGCCTC ACCGCACTCC GGGCGCATTT GATCCCGCTG 120 

CTCGCCGGCT TGTTGGTTCT GTGTCGCCGC GCTCGCCCCG GTTCCTCCTG CGCGCCACA 179 

ATG AGC TCC AGC ACC TTC AGG ACG CTC GCT GTC GCC GTC ACC CTT CTC 227 
Met Ser Ser Ser Thr Phe Arg Thr Leu Ala Val Ala Val Thr Leu Leu 
15 10 15 

CAC TTG ACC AGA CTG GCG CTC TCC ACC TGC CCC GCC GCC TGC CAC TGC 275 
His Leu Thr Arg Leu Ala Leu Ser Thr Cys Pro Ala Ala Cys His Cys 
20 25 30 

N= CCT CTG GAG GCA CCC AAG TGC GCC CCG GGA GTC GGG TTG GTC CGG GAC 323 
O Pro Leu Glu Ala Pro Lys Cys Ala Pro Gly Val Gly Leu Val Arg Asp 
D 35 40 45 

in 

jjU GGC TGC GGC TGC TGT AAG GTC TGC GCT AAA CAA CTC AAC GAG GAC TGC 371 

Ci Gly Cys Gly Cys Cys Lys Val Cys Ala Lys Gin Leu Asn Glu Asp Cys 

•n 50 * 55 60 

W AGC AAA ACT CAG CCC TGC GAC CAC ACC AAG GGG TTG GAA TGC AAT TTC 419 

5 Ser Lvs Thr Gin Pro Cys Asp His Thr Lys Gly Leu Glu Cys Asn Phe 

Q 65 70 75 80 

ft! GGC GCC AGC TCC ACC GCT CTG AAA GGG ATC TGC AGA GCT CAG TCA GAA 4 67 

Hi Gly Ala Ser Ser Thr Ala Leu Lys Gly He Cys Arg Ala Gin Ser Glu 
n 85 90 95 

lU GGC AGA CCC TGT GAA TAT AAC TCC AGA ATC TAC CAA AAC GGG GAA AGC 515 
Gly Arg Pro Cys Glu Tyr Asn Ser Arg He Tyr Gin Asn Gly Glu Ser 
100 105 HO 

TTC CAG CCC AAC TGT AAA CAC CAG TGC ACA TGT ATT GAT GGC GCC GTG 563 
Phe Gin Pro Asn Cys Lys His Gin Cys Thr Cys He Asp Gly Ala Val 
115 120 125 

GGC TGC ATT CCT CTG TGT CCC CAA GAA CTG TCT CTC CCC AAT CTG GGC 611 
Gly Cys He Pro Leu Cys Pro Gin Glu Leu Ser Leu Pro Asn Leu Gly 
130 ' 135 140 

TGT CCC AAC CCC CGG CTG GTG AAA GTC AGC GGG CAG TGC TGT GAA GAG 659 
Cys Pro Asn Pro Arg Leu Val Lys Val Ser Gly Gin Cys Cys Glu Glu 
145 150 155 160 

TGG GTT TGT GAT GAA GAC AGC ATT AAG GAC TCC CTG GAC GAC CAG GAT 707 
Trp Val Cys Asp Glu Asp Ser He Lys Asp Ser Leu Asp Asp Gin Asp 
165 170 175 

GAC CTC CTC GGA CTC GAT GCC TCG GAG GTG GAG TTA ACG AGA AAC AAT 755 
Asp Leu Leu Gly Leu Asp Ala Ser Glu Val Glu Leu Thr Arg Asn Asn 
180 185 190 



GAG TTA ATC GCA ATT GGA AAA GGC AGC TCA CTG AAG AGG CTT CCT GTC 803 
Glu Leu He Ala He Gly Lys Gly Ser Ser Leu Lys Arg Leu Pro Val 
195 200 205 

TTT GGC ACC GAA CCG CGA GTT CTT TTC AAC CCT CTG CAC GCC CAT GGC 851 
Phe Gly Thr Glu Pro Arg Val Leu Phe Asn Pro Leu His Ala His Gly 
210 215 220 

CAG AAA TGC ATC GTT CAG ACC ACG TCT TGG TCC CAG TGC TCC AAG AGC 899 
Gin Lys Cys He Val Gin Thr Thr Ser Trp Ser Gin Cys Ser Lys Ser 
225 230 235 240 

TGC GGA ACT GGC ATC TCC ACA CGA GTT ACC AAT GAC AAC CCA GAG TGC 947 
Cvs Glv Thr Gly He Ser Thr Arg Val Thr Asn Asp Asn Pro Glu Cys 
245 250 255 

CGC CTG GTG AAA GAG ACC CGG ATC TGT GAA GTG CGT CCT TGT GGA CAA 995 

i_j Arg Leu Val Lys Glu Thr Arg He Cys Glu Val Arg Pro Cys Gly Gin 
g 260 265 270 

3 CCA GTG TAC AGC AGC CTA AAA AAG GGC AAG AAA TGC AGC AAG ACC AAG 1043 

^ Pro Val Tyr Ser Ser Leu Lys Lys Gly Lys Lys Cys Ser Lys Thr Lys 
Ul 275 280 285 

if! AAA TCC CCA GAA CCA GTC AGA TTT ACT TAT GCA GGA TGC TCC AGT GTC 1091 

hi Lys Ser Pro Glu Pro Val Arg Phe Thr Tyr Ala Gly Cys Ser Ser Val 
l* 290 295 300 

F AAG AAA TAC CGG CCC AAA TAC TGC GGC TCC TGC GTA GAT GGC CGG TGC 1139 

Sf- Lys Lys Tyr Arg Pro Lys Tyr Cys Gly Ser Cys Val Asp Gly Arg Cys 

fU 305 310 315 320 

m 

O TGC ACA CCT CTG CAG ACC AGA ACT GTG AAG ATG CGG TTC CGA TGC GAA 1187 
Hi Cys Thr Pro Leu Gin Thr Arg Thr Val Lys Met Arg Phe Arg Cys Glu 

325 330 335 

GAT GGA GAG ATG TTT TCC AAG AAT GTC ATG ATG ATC CAG TCC TGC AAA 1235 
Asp Gly Glu Met Phe Ser Lys Asn Val Met Met He Gin Ser Cys Lys 
340 345 350 

TGT AAC TAC AAC TGC CCG CAT CCC AAC GAG GCA TCG TTC CGA CTG TAC 1283 
Cys Asn Tyr Asn Cys Pro His Pro Asn Glu Ala Ser Phe Arg Leu Tyr 
355 ' 360 365 

AGC CTA TTC AAT GAC ATC CAC AAG TTC AGG GAC TAAGTGCCTC CAGGGTTCCT 1336 
Ser Leu Phe Asn Asp He His Lys Phe Arg Asp 
370 375 

AGTGTGGGCT GGACAGAGGA GAAGCGCAAG CATCATGGAG ACGTGGGTGG GCGGAGGATG 1396 

AATGGTGCCT TGCTCATTCT TGAGTAGCAT TAGGGTATTT CAAAACTGCC AAGGGGCTGA 14 56 

TGTGGACGGA CAGCAGCGCA GCCG 1480 
(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 379 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(D) OTHER INFORMATION: "Mouse Cyr61 amino acid sequence" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ser Ser Ser Thr Phe Arg Thr Leu Ala Val Ala Val Thr Leu Leu 
15 10 15 

His Leu Thr Arg Leu Ala Leu Ser Thr Cys Pro Ala Ala Cys His Cys 
20 25 30 

Pro Leu Glu Ala Pro Lys Cys Ala Pro Gly Val Gly Leu Val Arg Asp 
35 40 45 

Gly Cys Gly Cys Cys Lys Val Cys Ala Lys Gin Leu Asn Glu Asp Cys 
50 55 60 

Ser Lys Thr Gin Pro Cys Asp His Thr Lys Gly Leu Glu Cys Asn Phe 
65 70 75- 80 

Gly Ala Ser Ser Thr Ala Leu Lys Gly He Cys Arg Ala Gin Ser Glu 
85 90 95 

Gly Arg Pro Cys Glu Tyr Asn Ser Arg He Tyr Gin Asn Gly Glu Ser 
100 105 HO 

Phe Gin Pro Asn Cys Lys His Gin Cys Thr Cys He Asp Gly Ala Val 
115 * 120 125 

Gly Cys He Pro Leu Cys Pro Gin Glu Leu Ser Leu Pro Asn Leu Gly 
130 135 140 

Cys Pro Asn Pro Arg Leu Val Lys Val Ser Gly Gin Cys Cys Glu Glu 
145 150 155 160 

Trp Val Cys Asp Glu Asp Ser He Lys Asp Ser Leu Asp Asp Gin Asp 
165 170 175 

Asp Leu Leu Gly Leu Asp Ala Ser Glu Val Glu Leu Thr Arg Asn Asn 
180 185 190 

Glu Leu He Ala He Gly Lys Gly Ser Ser Leu Lys Arg Leu Pro Val 
195 200 205 

Phe Gly Thr Glu Pro Arg Val Leu Phe Asn Pro Leu His Ala His Gly 
210 215 220 

Gin Lys Cys He Val Gin Thr Thr Ser Trp Ser Gin Cys Ser Lys Ser 
225 * 230 235 240 



# 



Cys Gly Thr Gly He Ser Thr Arg Val Thr Asn Asp Asn Pro Glu Cys 
245 250 255 

Arg Leu Val Lys Glu Thr Arg He Cys Glu Val Arg Pro Cys Gly Gin 
260 265 270 

Pro Val Tyr Ser Ser Leu Lys Lys Gly Lys Lys Cys Ser Lys Thr Lys 
275 280 285 

Lys Ser Pro Glu Pro Val Arg Phe Thr Tyr Ala Gly Cys Ser Ser Val 
290 295 300 

Lvs Lys Tyr Arg Pro Lys Tyr Cys Gly Ser Cys Val Asp Gly Arg Cys 
305 310 315 320 

Cvs Thr Pro Leu Gin Thr Arg Thr Val Lys Met Arg Phe Arg Cys Glu 
325 330 335 

- Asp Gly Glu Met Phe Ser Lys Asn Val Met Met He Gin Ser Cys Lys 
340 345 350 



a 



Cys Asn Tyr Asn Cys Pro His Pro Asn Glu Ala Ser Phe Arg Leu Tyr 
Ui ' 355 ' 360 365 

If! Ser Leu Phe Asn Asp He His Lys Phe Arg Asp 

l.i 370 375 

JU (2) INFORMATION FOR SEQ ID NO: 3: 

Jf; (i) SEQUENCE CHARACTERISTICS: 
rU (A) LENGTH: 1418 base pairs 

Hi (B) TYPE: nucleic acid 

p (C) STRANDEDNESS: single 

fy (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 124.. 1266 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(D) OTHER INFORMATION : "Human cyr61 cDNA coding sequence" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GGGCGGGCCC ACCGCGACAC CGCGCCGCCA CCCCGACCCC GCTGCGCACG GCCTGTCCGC 60 

TGCACACCAG CTTGTTGGCG TCTTCGTCGC CGCGCTCGCC CCGGGCTACT CCTGCGCGCC 120 

ACA ATG AGC TCC CGC ATC GCC AGG GCG CTC GCC TTA GTC GTC ACC CTT 168 
Met Ser Ser Arg He Ala Arg Ala Leu Ala Leu Val Val Thr Leu 
1 5 10 15 

CTC CAC TTG ACC AGG CTG GCG CTC TCC ACC TGC CCC GCT GCC TGC CAC 216 
Leu His Leu Thr Arg Leu Ala Leu Ser Thr Cys Pro Ala Ala Cys His 




20 



25 



30 



TGC CCC CTG GAG GCG CCC AAG TGC GCG CCG GGA GTC GGG CTG GTC CGG 264 
Cys Pro Leu Glu Ala Pro Lys Cys Ala Pro Gly Val Gly Leu Val Arg 
35 40 45 

GAC GGC TGC GGC TGC TGT AAG GTC TGC GCC AAG CAG CTC AAC GAG GAC 312 
Asp Gly Cys Gly Cys Cys Lys Val Cys Ala Lys Gin Leu Asn Glu Asp 
50 " " 55 60 

TGC AGC AAA ACG CAG CCC TGC GAC CAC ACC AAG GGG CTG GAA TGC AAC 360 
Cys Ser Lys Thr Gin Pro Cys Asp His Thr Lys Gly Leu Glu Cys Asn" 
65 70 75 

TTC GGC GCC AGC TCC ACC GCT CTG AAG GGG ATC TGC AGA GCT CAG TCA 4 08 

Phe Gly Ala Ser- Ser Thr Ala Leu Lys Gly He Cys Arg Ala Gin Ser 
80 85 90 95 

GAG GGC AGA CCC TGT GAA TAT AAC TCC AGA ATC TAC CAA AAC GGG GAA 4 56 

Glu Gly Arg Pro Cys Glu Tyr Asn Ser Arg He Tyr Gin Asn Gly Glu 
100 105 HO 



: I \ 



AGT TTC CAG CCC AAC TGT CAA CAT CAG TGC ACA TGT ATT GAT GGC GCC 504 
Ser Phe Gin Pro Asn Cys Gin His Gin Cys Thr Cys He Asp Gly Ala 
115 120 125 

GTG GGC TGC ATT CCT CTG TGT CCC CAA GAA CTA TCT CTC CCC AAC TTG 552 
Val Gly Cys He Pro Leu Cys Pro Gin Glu Leu Ser Leu Pro Asn Leu 
130 135 140 

GGC TGT CCC AAC CCT CGG CTG GTC AAA GTT ACC GGG CAG TGC TGC GAG 600 
Gly Cys Pro Asn Pro Arg Leu Val Lys Val Thr Gly Gin Cys Cys Glu 
145 150 155 

GAG TGG GTC TGT GAC GAG GAT AGT ATC AAG GAC CCC ATG GAG GAC CAG 648 
Glu Trp Val Cys Asp Glu Asp Ser He Lys Asp Pro Met Glu Asp Gin 
160 165 170 175 

GAC GGC CTC CTT GGC AAG GAG CTG GGA TTC GAT GCC TCC GAG GTG GAG 696 
Asp Gly Leu Leu Gly Lys Glu Leu Gly Phe Asp Ala Ser Glu Val Glu 
180 185 190 



TTG ACG AGA AAC AAT GAA TTG ATT GCA GTT GGA AAA GGC AGA TCA CTG 
Leu Thr Arg Asn Asn Glu Leu He Ala Val Gly Lys Gly Arg Ser Leu 
195 200 205 



744 



AAG CGG CTC CCT GTT TTT GGA ATG GAG CCT CGC ATC CTA TAC AAC CCT 7 92 

Lys Arg Leu Pro Val Phe Gly Met Glu Pro Arg He Leu Tyr Asn Pro 
210 215 220 

TTA CAA GGC CAG AAA TGT ATT GTT CAA ACA ACT TCA TGG TCC CAG TGC 84 0 

Leu Gin Gly Gin Lys Cys He Val Gin Thr Thr Ser Trp Ser Gin Cys 
225 230 235 



TCA AAG ACC TGT GGA ACT GGT ATC TCC ACA CGA GTT ACC AAT GAC AAC 
Ser Lys Thr Cys Gly Thr Gly He Ser Thr Arg Val Thr Asn Asp Asn 
240 245 250 255 



888 



# 



CCT GAG TGC CGC CTT GTG AAA GAA ACC CGG ATT TGT GAG GTG CGG CCT 936 
Pro Glu Cys Arg Leu Val Lys Glu Thr Arg He Cys Glu Val Arg Pro 
260 ' 265 270 

TGT GGA CAG CCA GTG TAC AGC AGC CTG AAA AAG GGC AAG AAA TGC AGC 984 
Cys Gly Gin Pro Val Tyr Ser Ser Leu Lys Lys Gly Lys Lys Cys Ser 
275 280 285 

AAG ACC AAG AAA TCC CCC GAA CCA GTC AGG TTT ACT TAC GCT GGA TGT 1032 
Lys Thr Lys Lys Ser Pro Glu Pro Val Arg Phe Thr Tyr Ala Gly Cys 
290 295 300 



TTG AGT GTG AAG AAA TAC CGG CCC AAG TAC TGC GGT TCC TGC GTG GAC 
Leu Ser Val Lys Lys Tyr Arg Pro Lys Tyr Cys Gly Ser Cys Val Asp 
305 310 315 



1080 



. , GGC CGA TGC TGC ACG CCC CAG CTG ACC AGG ACT GTG AAG ATG CGG TTC 1128 
*"* Glv Arg Cys Cys Thr Pro Gin Leu Thr Arg Thr Val Lys Met Arg Phe 
□ 320 " 325 330 335 

S CGC TGC GAA GAT GGG GAG ACA TTT TCC AAG AAC GTC ATG ATG ATC CAG 1176 
Li Arg Cys Glu Asp Gly Glu Thr Phe Ser Lys Asn Val Met Met He Gin 
Sj 340 345 350 

f.\ TCC TGC AAA TGC AAC TAC AAC TGC CCG CAT GCC AAT GAA GCA GCG TTT 1224 
W- Ser Cys Lys Cys Asn Tyr Asn Cys Pro His Ala Asn Glu Ala Ala Phe 
JL, " 355 360 365 

CCC TTC TAC AGG CTG TTC AAT GAC ATT CAC AAA TTT AGG GAC 1266 
TU Pro Phe Tyr Arg Leu Phe Asn Asp He His Lys Phe Arg Asp 
HI 370 375 380 

J TAAATGCTAC CTGGGTTTCC AGGGCACACC TAGACAAACA AGGGAGAAGA GTGTCAGAAT 132 6 

CAGAATCATG GAGAAAATGG GCGGGGGTGG TGTGGGTGAT GGGACTCATT GTAGAAAGGA 138 6 

AGCCTTCTCA TTCTTGAGGA GCATTAAGGT AT 1418 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(D) OTHER INFORMATION: "Human Cyr61 amino acid sequence" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ser Ser Arg He Ala Arg Ala Leu Ala Leu Val Val Thr Leu Leu 
15 10 15 



# 



W 145 



His Leu Thr Arg Leu Ala Leu Ser Thr Cys Pro Ala Ala Cys His Cys 
20 25 30 

Pro Leu Glu Ala Pro Lys Cys Ala Pro Gly Val Gly Leu Val Arg Asp 
35 40 45 

Gly Cys Gly Cys Cys Lys Val Cys Ala Lys Gin Leu Asn Glu Asp Cys 
50 55 60 

Ser Lys Thr Gin Pro Cys Asp His Thr Lys Gly Leu Glu Cys Asn Phe 
65 70 75 80 

Glv Ala Ser Ser Thr Ala Leu Lys Gly He Cys Arg Ala Gin Ser Glu 
85 90 95 

Gly Arg Pro Cys Glu Tyr Asn Ser Arg He Tyr Gin Asn Gly Glu Ser 
100 ' 105 HO 

Phe Gin Pro Asn Cys Gin His Gin Cys Thr Cys He Asp Gly Ala Val 
115 120 125 

Gly Cys He Pro Leu Cys Pro Gin Glu Leu Ser Leu Pro Asn Leu Gly 
130 135 140 

Cvs Pro Asn Pro Arg Leu Val Lys Val Thr Gly Gin Cys Cys Glu Glu 

150 155 160 



fu 



Trp Val Cys Asp Glu Asp Ser lie Lys Asp Pro Met Glu Asp Gin Asp 
165 170 175 

Gly Leu Leu Gly Lys Glu Leu Gly Phe Asp Ala Ser Glu Val Glu Leu 
180 185 190 

Thr Arg Asn Asn Glu Leu He Ala Val Gly Lys Gly Arg Ser Leu Lys 
195 200 205 

Arg Leu Pro Val Phe Gly Met Glu Pro Arg He Leu Tyr Asn Pro Leu 
210 215 220 

Gin Gly Gin Lys Cys He Val Gin Thr Thr Ser Trp Ser Gin Cys Ser 
225 230 235 240 

Lys Thr Cys Gly Thr Gly He Ser Thr Arg Val Thr Asn Asp Asn Pro 
245 250 255 

Glu Cys Axg Leu Val Lys Glu Thr Arg He Cys Glu Val Arg Pro Cys 
260 265 270 

Gly Gin Pro Val Tyr Ser Ser Leu Lys Lys Gly Lys Lys Cys Ser Lys 
275 280 285 

Thr Lys Lys Ser Pro Glu Pro Val Arg Phe Thr Tyr Ala Gly Cys Leu 
290 295 300 

Ser Val Lys Lys Tyr Arg Pro Lys Tyr Cys Gly Ser Cys Val Asp Gly 
305 310 315 320 



# 



Arq Cys Cys Thr Pro Gin Leu Thr Arg Thr Val Lys Met Arg Phe Arg 
325 330 335 

Cys Glu Asp Gly Glu Thr Phe Ser Lys Asn Val Met Met He Gin Ser 
340 345 350 

Cys Lys Cys Asn Tyr Asn Cys Pro His Ala Asn Glu Ala Ala Phe Pro 
355 360 365 

Phe Tyr Arg Leu Phe Asn Asp He His Lys Phe Arg Asp 
370 ~ 375 380 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2267 base pairs 

(B) TYPE: nucleic acid 
%i (C) STRANDEDNESS: single 
jpf (D) TOPOLOGY: linear 

pj (ii) MOLECULE TYPE: DNA 

w 

^ (ix) FEATURE: 

HI (A) NAME/KEY: misc_feature 

yj (D) OTHER INFORMATION: "Fispl2 cDNA coding sequence" 

H (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

iti GAATTCCGCC GACAACCCCA GACGCCACCG CCTGGAGCGT CCAGACACCA ACCTCCGCCC 60 

J CTGTCCGAAT CCAGGCTCCA GCCGCGCCTC TCGTCGCCTC TGCACCCTGC TGTGCATCCT 120 

Hi CCTACCGCGT CCCGATCATG CTCGCCTCCG TCGCAGGTCC CATCAGCCTC GCCTTGGTGC 180 

TCCTCGCCCT CTGCACCCGG CCTGCTACGG GCCAGGACTG CAGCGCGCAA TGTCAGTGCG 24 0 

CAGCCGAAGC AGCGCCGCAC TGCCCCGCCG GCGTGAGCCT GGTGCTGGAC GGCTGCGGCT 300 

GCTGCCGCGT CTGCGCCAAG CAGCTGGGAG AACTGTGTAC GGAGCGTGAC CCCTGCGACC 360 

CACACAAGGG CCTCTTCTGC GATTTCGGCT CCCCCGCCAA CCGCAAGATT GGAGTGTGCA 420 

CTGCCAAAGA TGGTGCACCC TGTGTCTTCG GTGGGTCGGT GTACCGCAGC GGTGAGTCCT 4 80 

TCCAAAGCAG CTGCAAATAC CAATGCACTT GCCTGGATGG GGCCGTGGGC TGCGTGCCCC 540 

TATGCAGCAT GGACGTGCGC CTGCCCAGCC CTGACTGCCC CTTCCCGAGA AGGGTCAAGC 600 

TGCCTGGGAA ATGCTGCAAG GAGTGGGTGT GTGACGAGCC CAAGGACCGC ACAGCAGTTG 660 

GCCCTGCCCT AGCTGCCTAC CGACTGGAAG ACACATTTGG CCCAGACCCA AC TAT GAT GC 720 

GAGCCAACTG CCTGGTCCAG ACCACAGAGT GGAGCGCCTG TTCTAAGACC TGTGGAATGG 780 

GCATCTCCAC CCGAGTTACC AATGACAATA CCTTCTGCAG ACTGGAGAAG CAGAGCCGCC 840 

TCTGCATGGT CAGGCCCTGC GAAGCTGACC TGGAGGAAAA CATTAAGAAG GGCAAAAAGT 900 



# 



GCATCCGGAC 



ACCTAAAATC 



GCCAAGCCTG 



TCAAGTTTGA 



GCTTTCTGGC TGCACCAGTG 



960 



TGAAGACATA CAGGGCTAAG TTCTGCGGGG TGTGCACAGA CGGCCGCTGC TGCACACCGC 1020 

ACAGAACCAC CACTCTGCCA GTGGAGTTCA AATGCCCCGA TGGCGAGATC ATGAAAAAGA 1080 

ATATGATGTT CATCAAGACC TGTGCCTGCC ATTACAACTG TCCTGGGGAC AATGACATCT 114 0 

TTGAGTCCCT GTACTACAGG AAGATGTACG GAGACATGGC GTAAAGCCAG GAAGTAAGGG 1200 

ACACGAACTC ATTAGACTAT AACTTGAACT GAGTTGCATC TCATTTTCTT CTGTAAAAAC 1260 

AATTACAGTA GCACATTAAT TTAAATCTGT GTTTTTAACT ACCGTGGGAG GAACTATCCC 1320 

ACCAAAGTGA GAACGTTATG TCATGGCCAT ACAAGTAGTC TGTCAACCTC AGACACTGGT 1380 

U TTCGAGACAG TTTACACTTG ACAGTTGTTC ATTAGCGCAC AGTGCCAGAA CGCACACTGA 1440 

Q GGTGAGTCTC CTGGAACAGT GGAGATGCCA GGAGAAAGAA AGACAGGTAC TAGCTGAGGT 1500 

f_\ TATTTTAAAA GCAGCAGTGT GCCTACTTTT TGGAGTGTAA CCGGGGAGGG AAATTATAGC 1560 

" %! ATGCTTGCAG ACAGACCTGC TCTAGCGAGA GCTGAGCATG TGTCCTCCAC TAGATGAGGC 1620 

LTi 

W TGAGTCCAGC TGTTCTTTAA GAACAGCAGT TTCAGCCTCT GACCATTCTG ATTCCAGTGA 1680 



p CACTTGTCAG GAGTCAGAGC CTTGTCTGTT AGACTGGACA GCTTGTGGCA AGTAAGTTTG 1740 
Hi CCTGTAACAA GCCAGATTTT TATTGATATT GTAAATATTG TG GAT AT AT A TATATATATA 1800 
Ob! TATATTTGTA CAGTTATCTA AGTTAATTTA AAGTCATTTG TTTTTGTTTT AAGTGCTTTT 1860 



GGGATTTTAA ACTGATAGCC TCAAACTCCA AACACCATAG GTAGGACACG AAGCTTATCT 1920 

GTGATTCAAA ACAAAGGAGA TACTGCAGTG GGAATTGTGA CCTGAGTGAC TCTCTGTCAG 1980 

AACAAACAAA TGCTGTGCAG GTGATAAAGC TATGTATTGG AAGTCAGATT TCTAGTAGGA 2040 

AATGTGGTCA AATCCCTGTT GGTGAACAAA TGGCCTTTAT TAAGAAATGG CTGGCTCAGG 2100 

GTAAGGTCCG ATTCCTACCA GGAAGTGCTT GCTGCTTCTT TGATTATGAC TGGTTTGGGG 2160 

TGGGGGGCAG TTTATTTGTT GAGAGTGTGA CCAAAAGTTA CATGTTTGCA CCTTTCTAGT 2220 

TGAAAATAAA G TAT AT AT AT ATTTTTTATA TGAAAAAAAA GGAATTC 2267 
(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(D) OTHER INFORMATION: "Fispl2 amino acid sequence" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Leu Ala Ser Val Ala Gly Pro He Ser Leu Ala Leu Val Leu Leu 
15 10 15 

Ala Leu Cys Thr Arg Pro Ala Thr Gly Gin Asp Cys Ser Ala Gin Cys 
20 25 30 



Gin Cys Ala Ala Glu Ala Ala Pro His Cys Pro Ala Gly Val Ser Leu 
35 40 45 

Val Leu Asp Gly Cys Gly Cys Cys Arg Val Cys Ala Lys Gin Leu Gly 
50 55 60 

Glu Leu Cys Thr Glu Arg Asp Pro Cys Asp Pro His Lys Gly Leu Phe 
65 70 75 80 

Cys Asp Phe Gly Ser Pro Ala Asn Arg Lys He Gly Val Cys Thr Ala 
85 90 95 

Lys Asp Gly Ala Pro Cys Val Phe Gly Gly Ser Val Tyr Arg Ser Gly 
100 105 HO 

Glu Ser Phe Gin Ser Ser Cys Lys Tyr Gin Cys Thr Cys Leu Asp Gly 
115 120 125 

Ala Val Gly Cys Val Pro Leu Cys Ser Met Asp Val Arg Leu Pro Ser 
130 135 140 

Pro Asp Cys Pro Phe Pro Arg Arg Val Lys Leu Pro Gly Lys Cys Cys 
145 1 " 150 155 160 

Lys Glu Trp Val Cys Asp Glu Pro Lys Asp Arg Thr Ala Val Gly Pro 
165 170 175 

Ala Leu Ala Ala Tyr Arg Leu Glu Asp Thr Phe Gly Pro Asp Pro Thr 
180 ~ 185 190 



Met Met Arg Ala Asn Cys Leu Val Gin Thr Thr Glu Trp Ser Ala Cys 
195 200 205 



Ser Lys Thr Cys Gly Met Gly He 
210 215 

Thr Phe Cys Arg Leu Glu Lys Gin 
225 230 

Cys Glu Ala Asp Leu Glu Glu Asn 
245 

Arg Thr Pro Lys He Ala Lys Pro 
260 



Ser Thr Arg Val Thr Asn Asp Asn 
220 

Ser Arg Leu Cys Met Val Arg Pro 

235 240 

He Lys Lys Gly Lys Lys Cys He 
250 255 

Val Lys Phe Glu Leu Ser Gly Cys 
265 270 



n 




Thr Ser Val Lys Thr Tyr Arg Ala Lys Phe Cys Gly Val Cys Thr Asp 
275 280 285 

Gly Arg Cys Cys Thr Pro His Arg Thr Thr Thr Leu Pro Val Glu Phe 
290 295 300 

Lys Cys Pro Asp Gly Glu lie Met Lys Lys Asn Met Met Phe lie Lys 
305 ~ 310 315 320 

Thr Cys Ala Cys His Tyr Asn Cys Pro Gly Asp Asn Asp lie Phe Glu 
325 330 335 

Ser Leu Tyr Tyr Arg Lys Met Tyr Gly Asp Met Ala 
340 345 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2075 base pairs 
Q (B) TYPE: nucleic acid 

Ul (C) STRANDEDNESS: single 

ki (D) TOPOLOGY: linear 

in (ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(D) OTHER INFORMATION: "CTGF cDNA coding sequence" 



§y (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Q CCCGGCCGAC AGCCCCGAGA CGACAGCCCG GCGCGTCCCG GTCCCCACCT CCGACCACCG 60 

CCAGCGCTCC AGGCCCCGCG CTCCCCGCTC GCCGCCACCG CGCCCTCCGC TCCGCCCGCA 120 

GTGCCAACCA TGACCGCCGC CAGTATGGGC CCCGTCCGCG TCGCCTTCGT GGTCCTCCTC 180 

GCCCTCTGCA GCCGGCCGGC CGTCGGCCAG AACTGCAGCG GGCCGTGCCG GTGCCCGGAC 24 0 

GAGCCGGCGC CGCGCTGCCC GGCGGGCGTG AGCCTCGTGC TGGACGGCTG CGGCTGCTGC 300 

CGCGTCTGCG CCAAGCAGCT GGGCGAGCTG TGCACCGAGC GCGACCCCTG CGACCCGCAC 360 

AAGGGCCTCT TCTGTGACTT CGGCTCCCCG GCCAACCGCA AGATCGGCGT GTGCACCGCC 420 

AAAGATGGTG CTCCCTGCAT CTTCGGTGGT ACGGTGTACC GCAGCGGAGA GTCCTTCCAG 480 

AGCAGCTGCA AGTACCAGTG CACGTGCCTG GACGGGGCGG TGGGCTGCAT GCCCCTGTGC 540 

AGCATGGACG TTCGTCTGCC CAGCCCTGAC TGCCCCTTCC CGAGGAGGGT CAAGCTGCCC 600 

GGGAAATGCT GCGAGGAGTG GGTGTGTGAC GAGCCCAAGG ACCAAACCGT GGTTGGGCCT 660 

GCCCTCGCGG CTTACCGACT GGAAGACACG TTTGGCCCAG ACCCAACTAT GATTAGAGCC 720 

AACTGCCTGG TCCAGACCAC AGAGTGGAGC GCCTGTTCCA AGACCTGTGG GAT GGG CATC 780 
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TCCACCCGGG 


TTACCAATGA 


CAACGCCTCC 


TGCAGGCTAG 


AGAAGCAGAG 


CCGCCTGTGC 




ATGGTCAGGC 


CTTGCGAAGC 


TGACCTGGAA 


GAGAACATTA 


AGAAGGGCAA 


tv tv tv /***■« m tv rri 

AAAGTGCATC 




CGTACTCCCA 


AAATCTCCAA 


GCCTATCAAG 


TTTGAGCTTT 


CTGGCTGCAC 


CAGCATGAAG 


y ou 


ACATACCGAG 


CTAAATTCTG 


TGGAGTATGT 


ACCGACGGCC 


GATGCTGCAC 


CCCCCACAGA 




ACCACCACCC 


TGCCGGTGGA 


GTTCAAGTGC 


CCTGACGGCG 


AGGTCATGAA 


GAAGAACATG. 


lUoU 


ATGTTCATCA AGACCTGTGC CTGCCATTAC 


AACTGTCCCG 


GAGACAATGA 


CATCTTTGAA 




TCGCTGTACT 


ACAGGAAGAT 


GTACGGAGAC 


ATGGCATGAA 


GCCAGAGAGT 


GAGAGACATT 


lzUU 


AACTCATTAG 


ACTGGAACTT 


GAACTGATTC 


ACATCTCATT 


TTTCCGTAAA 


AATGATTTCA 


1Z bU 


GTAGCACAAG 


TTATTTAAAT 


CTGTTTTTCT 


AACTGGGGGA 


AAAGATTCCC 


ACCCAATTCA 


loZU 


AAACATTGTG 


CCATGTCAAA 


CAAATAGTCT 


ATCTTCCCCA 


GACACTGGTT 


TGAAGAATGT 


I o o r\ 


TAAGACTTGA 


CAGTGGAACT 


ACATTAGTAC 


ACAGCACCAG 


AATGTATATT 


AAGGTGTGGC 


1 A A f\ 

1440 


TTTAGGAGCA 


GTGGGAGGGT 


ACCGGCCCGG 


T TAG TAT CAT 


CAGATCGACT 


CT TAT AC GAG 


loOU 


TAATATGCCT 


GCTATTTGAA 


GTGTAATTGA 


GAAGGAAAAT 


TTTAGCGTGC 


TCACTGACCT 


1560 


GCCTGTAGCC 


CCAGTGACAG 


CTAGGATGTG 


CATTCTCCAG 


CCATCAAGAG 


ACTGAGTCAA 


i c o n 

lozU 


GTTGTTCCTT 


AAGTCAGAAC 


AGCAGACTCA 


GCTCTGACAT 


TCTGATTCGA 


ATGACACTGT 


i /" o n 


TCAGGAATCG 


GAATCCTGTC 


GATTAGACTG 


GACAGCTTGT 


GGCAAGTGAA 


TTTGCCTGTA 


1740 


ACAAGCCAGA 


TTTTTTAAAA 


TTTATATTGT 


AAATATTGTG 


TGTGTGTGTG 


TGTGTGTATA 


i o a n 

loUU 


TATATATATA 


TATGTACAGT 


TATCTAAGTT 


AATTTAAAGT 


TGTTTGTGCC 


m m m m rn ~T\ rp rn rp rp 

TTTTTAi 111 


loOU 


TGTTTTTAAT 


GCTTTGATAT 


TTCAATGTTA 


GCCTCAATTT 


C T G AAC AC U A 


t 1 a en t 7A f zi ta t 

1 Avjlj 1 H<aRK 1 


1 Q?n 
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GTAAAGCTTG 


TCTGATCGTT 


CAAAGCATGA 


AATGGATACT 


TATATGGAAA 


TTCTGCTCAG 


1980 


ATAGAATGAC AGTCCGTCAA AACAGATTGT 


TTGCAAAGGG 


GAGGCATCAG 


TGTCTTGGCA 


2040 


GGCTGATTTC 


TAGGTAGGAA 


ATGTGGTAGC 


TCACG 






2075 


(2) INFORMATION FOR SEQ ID NO : 8 : 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(D) OTHER INFORMATION: "CTGF amino acid sequence" 



# * 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Thr Ala Ala Ser Met Gly Pro Val Arg Val Ala Phe Val Val Leu 
15 10 15 

Leu Ala Leu Cys Ser Arg Pro Ala Val Gly Gin Asn Cys Ser Gly Pro 
20 25 30 

Cys Arg Cys Pro Asp Glu Pro Ala Pro Arg Cys Pro Ala Gly Val Ser 
35 40 45 

Leu Val Leu Asp Gly Cys Gly Cys Cys Arg Val Cys Ala Lys Gin Leu 
50 55 60 

Gly Glu Leu Cys Thr Glu Arg Asp Pro Cys Asp Pro His Lys Gly Leu 
65 70 75 80 

Phe Cys Asp Phe Gly Ser Pro Ala Asn Arg Lys lie Gly Val Cys Thr 
85 90 95 

Ala Lys Asp Gly Ala Pro Cys lie Phe Gly Gly Thr Val Tyr Arg Ser 
100 105 110 

Gly Glu Ser Phe Gin Ser Ser Cys Lys Tyr Gin Cys Thr Cys Leu Asp 
115 120 125 

Gly Ala Val Gly Cys Met Pro Leu Cys Ser Met Asp Val Arg Leu Pro 
130 135 140 

Ser Pro Asp Cys Pro Phe Pro Arg Arg Val Lys Leu Pro Gly Lys Cys 
145 150 155 160 

Cys Glu Glu Trp Val Cys Asp Glu Pro Lys Asp Gin Thr Val Val Gly 
165 170 175 

Pro Ala Leu Ala Ala Tyr Arg Leu Glu Asp Thr Phe Gly Pro Asp Pro 
180 185 " 190 

Thr Met lie Arg Ala Asn Cys Leu Val Gin Thr Thr Glu Trp Ser Ala 
195 200 205 

Cys Ser Lys Thr Cys Gly Met Gly lie Ser Thr Arg Val Thr Asn Asp 
210 215 220 

Asn Ala Ser Cys Arg Leu Glu Lys Gin Ser Arg Leu Cys Met Val Arg 
225 230 235 240 

Pro Cys Glu Ala Asp Leu Glu Glu Asn lie Lys Lys Gly Lys Lys Cys 
245 250 255 

lie Arg Thr Pro Lys lie Ser Lys Pro lie Lys Phe Glu Leu Ser Gly 
260 265 270 



Cys Thr Ser Met Lys Thr Tyr Arg Ala Lys Phe Cys Gly Val Cys Thr 
275 280 285 



Asp Gly Arg Cys Cys Thr Pro His Arg Thr Thr Thr Leu Pro Val Glu 
290 295 300 

Phe Lys Cys Pro Asp Gly Glu Val Met Lys Lys Asn Met Met Phe lie 
305 310 315 320 

Lys Thr Cys Ala Cys His Tyr Asn Cys Pro Gly Asp Asn Asp lie Phe 
325 330 335 

Glu Ser Leu Tyr Tyr Arg Lys Met Tyr Gly Asp Met Ala 
340 345 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGGGATCTGT GACGAGCCCA AGGAC 25 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGGAATTCGA CCAGGCAGTT GGCTCG 26 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGGGATCCTG TGATGAAGAC AGCATT 2 6 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGGAATTCAA CGATGCATTT CTGGCC 26 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Asp Gly Cys Gly Cys Cys Lys Val Cys Ala Lys Gin Leu Asn Glu Asp 
1 5 ~ 10 15 

Cys Ser Lys Thr Gin 
20 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Pro Asn Cys Lys His Gin Cys Thr Cys He Asp Gly Ala Val Gly Cys 
15 10 15 

He Pro Leu Cys Pro 
20 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



W (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Lfi Cys He Val Gin Thr Thr Ser Trp Ser Gin Cys Ser Lys Ser Cys Gly 

Ly 1 5 10 15 

& Thr Gly He Ser Thr Arg Val Thr 

£7 20 

r~ 

LH (2) INFORMATION FOR SEQ ID NO: 16: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

He Ser Thr Arg Val Thr Asn Asp Asn Pro Glu Cys Arg Leu Val Lys 
1 5 10 15 

Glu Thr Arg He Cys Glu Val Arg Pro Cys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 



(D) TOPOLOGY: not relevant 
(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Lys Tyr Cys Gly Ser Cys Val Asp Gly Arg Cys Cys Thr Pro Leu Gin 
15 10 15 

Thr Arg Thr Val Lys 
20 



